Nonparametric Tests of Differences in Medians: Comparison of the Wilcoxon–Mann–Whitney and Robust Rank–Order Tests
نویسنده
چکیده
The nonparametric Wilcoxon–Mann–Whitney test is commonly used by experimental economists for detecting differences in central tendency between two samples. This test is only theoretically appropriate under certain assumptions concerning the population distributions from which the samples are drawn, and is often used in cases where it is unclear whether these assumptions hold, and even when they clearly do not hold. Fligner and Pollicello’s (1981) robust rank–order test is a modification of the Wilcoxon–Mann–Whitney test, designed to be appropriate in more situations than Wilcoxon–Mann–Whitney. This paper uses simulations to compare the performance of the two tests under a variety of distributional assumptions. The results are mixed. The robust rank–order test tends to yield too many false positive results for medium–sized samples, but this liberalness is relatively invariant across distributional assumptions, and seems to be due to a deficiency of the normal approximation to its test statistic’s distribution, rather than the test itself. The performance of the Wilcoxon–Mann–Whitney test varies hugely, depending on the distributional assumptions; in some cases, it is conservative, in others, extremely liberal. The tests have roughly similar power. Overall, the robust rank–order test performs better than Wilcoxon–Mann–Whitney, though when critical values for the robust rank–order test are not available, their relative performance depends on the underlying distributions, the sample sizes, and the level of significance used. Journal of Economic Literature classifications: C14, C12, C90
منابع مشابه
Inflation of Type I Error Rates by Unequal Variances Associated with Parametric, Nonparametric, and Rank-Transformation Tests
It is well known that the two-sample Student t test fails to maintain its significance level when the variances of treatment groups are unequal, and, at the same time, sample sizes are unequal. However, introductory textbooks in psychology and education often maintain that the test is robust to variance heterogeneity when sample sizes are equal. The present study discloses that, for a wide vari...
متن کاملIntroduction to biostatistics: Part 5, Statistical inference techniques for hypothesis testing with nonparametric data.
Specific statistical tests are used when the null hypothesis (H0) is to be tested using nonparametric nominal or ordinal data. With nominal data, experimental results are expressed by proportions or frequencies. Chi-square or related tests (the Fisher's exact test or the rows by columns test) are appropriate for testing H0 with nominal data. Ordinal data permit arrangement of statistical result...
متن کاملRobust nonparametric tests for the two-sample location problem
We construct and investigate robust nonparametric tests for the twosample location problem. A test based on a suitable scaling of the median of the set of differences between the two samples, which is the Hodges-Lehmann shift estimator corresponding to the Wilcoxon two-sample rank test, leads to higher robustness against outliers than the Wilcoxon test itself, while preserving its efficiency un...
متن کاملRobust Nonparametric Testing for Causal Inference in Observational Studies
We consider the decision problem of making causal conclusions from observational data. Typically, using standard matched pairs techniques, there is a source of uncertainty that is not usually quantified, namely the uncertainty due to the choice of the experimenter: two different reasonable experimenters can easily have opposite results. In this work we present an alternative to the standard non...
متن کاملComparative Power Of The Independent t, Permutation t, and WilcoxonTests
The nonparametric Wilcoxon Rank Sum (also known as the Mann-Whitney U) and the permutation t-tests are robust with respect to Type I error for departures from population normality, and both are powerful alternatives to the independent samples Student's t-test for detecting shift in location. The question remains regarding their comparative statistical power for small samples, particularly for n...
متن کامل